Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 1670995 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 495.5 MiB |
| Average record size in memory | 311.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 3 |
Reproduction
| Analysis started | 2020-05-25 10:40:53.484613 |
|---|---|
| Analysis finished | 2020-05-25 10:43:42.476667 |
| Duration | 2 minutes and 48.99 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
brand has constant value "Globe Postpaid" | Constant |
clusterId has a high cardinality: 149 distinct values | High cardinality |
dataRevenue is highly correlated with dataRevenuePredicted | High correlation |
dataRevenuePredicted is highly correlated with dataRevenue | High correlation |
smsRevenue is highly correlated with smsRevenuePredicted | High correlation |
smsRevenuePredicted is highly correlated with smsRevenue | High correlation |
data is highly skewed (γ1 = 31.9875023) | Skewed |
sms is highly skewed (γ1 = 126.3236899) | Skewed |
subsId has unique values | Unique |
data has 141095 (8.4%) zeros | Zeros |
voice has 109681 (6.6%) zeros | Zeros |
sms has 59673 (3.6%) zeros | Zeros |
dataRevenuePredicted has 219596 (13.1%) zeros | Zeros |
voiceRevenuePredicted has 222363 (13.3%) zeros | Zeros |
smsRevenuePredicted has 296058 (17.7%) zeros | Zeros |
dataRevenue has 219596 (13.1%) zeros | Zeros |
voiceRevenue has 222363 (13.3%) zeros | Zeros |
smsRevenue has 296058 (17.7%) zeros | Zeros |
| Distinct count | 1670995 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.7 MiB |
| J41EATby/kOcRuBqMAKDnZ2d/+6auWRl | 1 |
|---|---|
| J41EATby/kOcRuBtNAT8o5rA+JLQuWRl | 1 |
| J41EATby/kOcD+hsPkqLnZq3+JrRn2Rl | 1 |
| J41EATby/kOcRuBqMDuPhpqL3o6buWRl | 1 |
| J41EATby/kOcD+htMDuPppqy3pKauWRl | 1 |
| Other values (1670990) |
| Value | Count | Frequency (%) | |
| J41EATby/kOcRuBqMAKDnZ2d/+6auWRl | 1 | < 0.1% | |
| J41EATby/kOcRuBtNAT8o5rA+JLQuWRl | 1 | < 0.1% | |
| J41EATby/kOcD+hsPkqLnZq3+JrRn2Rl | 1 | < 0.1% | |
| J41EATby/kOcRuBqMDuPhpqL3o6buWRl | 1 | < 0.1% | |
| J41EATby/kOcD+htMDuPppqy3pKauWRl | 1 | < 0.1% | |
| J41EATby/kOcD5NqMAKTgJq13pLRn2Rl | 1 | < 0.1% | |
| J41EATby/kOcRuBtNDj8hpq13o6buWRl | 1 | < 0.1% | |
| J41EATby/kOcAN5sPlmDkZ2d+KabuWRl | 1 | < 0.1% | |
| J41EATby/kOcD+hqNAT8g5qJ7IqauWRl | 1 | < 0.1% | |
| J41EATby/kOcD+BsMlmThJqL3rKauWRl | 1 | < 0.1% | |
| J41EATby/kOcD+BsMlmDlJ2d+JrAuWRl | 1 | < 0.1% | |
| J41EATby/kOcD5NsNDj8nZq3+I6buWRl | 1 | < 0.1% | |
| J41EATby/kOcRuBsMASLkZqJ7JLQn2Rl | 1 | < 0.1% | |
| J41EATby/kOcRuBsNAKDgZqI3pqauWRl | 1 | < 0.1% | |
| J41EATby/kOcAM5tPkqhg5qz7LLAuWRl | 1 | < 0.1% | |
| J41EATby/kOcRuBsMAeho5qL3o7QuWRl | 1 | < 0.1% | |
| J41EATby/kOcD+hqNAT8kZqL3o7Rn2Rl | 1 | < 0.1% | |
| J41EATby/kOcD5NsPk6LkZq3+IrQj2Rl | 1 | < 0.1% | |
| J41EATby/kOcRuBtPlmTgZqJ7LzQuWRl | 1 | < 0.1% | |
| J41EATby/kOcAM5tPlmTo5qJ7LybuWRl | 1 | < 0.1% | |
| J41EATby/kOcRuBtPkr8gZq13o7QuWRl | 1 | < 0.1% | |
| J41EATby/kOcD+BsMDuPkZqz4+7Rj2Rl | 1 | < 0.1% | |
| J41EATby/kOcRuBsPlmDppq13rLRj2Rl | 1 | < 0.1% | |
| J41EATby/kOcAN5tMAShhpqz7I7Bj2Rl | 1 | < 0.1% | |
| J41EATby/kOcD+BsMlmDgZqJ4+7Qj2Rl | 1 | < 0.1% | |
| Other values (1670970) | 1670970 | > 99.9% |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Most occurring characters
| Value | Count | Frequency (%) | |
| A | 3161844 | 5.9% | |
| J | 2671160 | 5.0% | |
| R | 2499840 | 4.7% | |
| k | 2142413 | 4.0% | |
| b | 2101290 | 3.9% | |
| l | 2081244 | 3.9% | |
| y | 1954147 | 3.7% | |
| T | 1930444 | 3.6% | |
| q | 1922224 | 3.6% | |
| 1 | 1841600 | 3.4% | |
| 4 | 1727152 | 3.2% | |
| E | 1672900 | 3.1% | |
| O | 1663464 | 3.1% | |
| c | 1660444 | 3.1% | |
| / | 1629504 | 3.0% | |
| u | 1250999 | 2.3% | |
| D | 1204572 | 2.3% | |
| 2 | 1198164 | 2.2% | |
| 5 | 1082192 | 2.0% | |
| M | 1064170 | 2.0% | |
| + | 1026342 | 1.9% | |
| N | 992995 | 1.9% | |
| h | 981954 | 1.8% | |
| L | 952937 | 1.8% | |
| s | 909514 | 1.7% | |
| Other values (27) | 12148331 | 22.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 22475360 | 42.0% | |
| Lowercase Letter | 20240847 | 37.9% | |
| Decimal Number | 8099787 | 15.1% | |
| Other Punctuation | 1629504 | 3.0% | |
| Math Symbol | 1026342 | 1.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 3161844 | 14.1% | |
| J | 2671160 | 11.9% | |
| R | 2499840 | 11.1% | |
| T | 1930444 | 8.6% | |
| E | 1672900 | 7.4% | |
| O | 1663464 | 7.4% | |
| D | 1204572 | 5.4% | |
| M | 1064170 | 4.7% | |
| N | 992995 | 4.4% | |
| L | 952937 | 4.2% | |
| B | 849211 | 3.8% | |
| W | 819189 | 3.6% | |
| Q | 667199 | 3.0% | |
| Z | 649188 | 2.9% | |
| P | 477110 | 2.1% | |
| K | 388422 | 1.7% | |
| I | 386512 | 1.7% | |
| S | 357295 | 1.6% | |
| X | 64519 | 0.3% | |
| G | 2389 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1841600 | 22.7% | |
| 4 | 1727152 | 21.3% | |
| 2 | 1198164 | 14.8% | |
| 5 | 1082192 | 13.4% | |
| 7 | 856502 | 10.6% | |
| 3 | 834307 | 10.3% | |
| 8 | 323639 | 4.0% | |
| 6 | 236231 | 2.9% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| k | 2142413 | 10.6% | |
| b | 2101290 | 10.4% | |
| l | 2081244 | 10.3% | |
| y | 1954147 | 9.7% | |
| q | 1922224 | 9.5% | |
| c | 1660444 | 8.2% | |
| u | 1250999 | 6.2% | |
| h | 981954 | 4.9% | |
| s | 909514 | 4.5% | |
| p | 763570 | 3.8% | |
| r | 630460 | 3.1% | |
| j | 606297 | 3.0% | |
| g | 556282 | 2.7% | |
| t | 510393 | 2.5% | |
| n | 482220 | 2.4% | |
| z | 316652 | 1.6% | |
| o | 291752 | 1.4% | |
| e | 269780 | 1.3% | |
| x | 256174 | 1.3% | |
| a | 234519 | 1.2% | |
| d | 176610 | 0.9% | |
| m | 141909 | 0.7% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1629504 | 100.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 1026342 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 42716207 | 79.9% | |
| Common | 10755633 | 20.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| A | 3161844 | 7.4% | |
| J | 2671160 | 6.3% | |
| R | 2499840 | 5.9% | |
| k | 2142413 | 5.0% | |
| b | 2101290 | 4.9% | |
| l | 2081244 | 4.9% | |
| y | 1954147 | 4.6% | |
| T | 1930444 | 4.5% | |
| q | 1922224 | 4.5% | |
| E | 1672900 | 3.9% | |
| O | 1663464 | 3.9% | |
| c | 1660444 | 3.9% | |
| u | 1250999 | 2.9% | |
| D | 1204572 | 2.8% | |
| M | 1064170 | 2.5% | |
| N | 992995 | 2.3% | |
| h | 981954 | 2.3% | |
| L | 952937 | 2.2% | |
| s | 909514 | 2.1% | |
| B | 849211 | 2.0% | |
| W | 819189 | 1.9% | |
| p | 763570 | 1.8% | |
| Q | 667199 | 1.6% | |
| Z | 649188 | 1.5% | |
| r | 630460 | 1.5% | |
| Other values (17) | 5518835 | 12.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 1841600 | 17.1% | |
| 4 | 1727152 | 16.1% | |
| / | 1629504 | 15.2% | |
| 2 | 1198164 | 11.1% | |
| 5 | 1082192 | 10.1% | |
| + | 1026342 | 9.5% | |
| 7 | 856502 | 8.0% | |
| 3 | 834307 | 7.8% | |
| 8 | 323639 | 3.0% | |
| 6 | 236231 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 53471840 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| A | 3161844 | 5.9% | |
| J | 2671160 | 5.0% | |
| R | 2499840 | 4.7% | |
| k | 2142413 | 4.0% | |
| b | 2101290 | 3.9% | |
| l | 2081244 | 3.9% | |
| y | 1954147 | 3.7% | |
| T | 1930444 | 3.6% | |
| q | 1922224 | 3.6% | |
| 1 | 1841600 | 3.4% | |
| 4 | 1727152 | 3.2% | |
| E | 1672900 | 3.1% | |
| O | 1663464 | 3.1% | |
| c | 1660444 | 3.1% | |
| / | 1629504 | 3.0% | |
| u | 1250999 | 2.3% | |
| D | 1204572 | 2.3% | |
| 2 | 1198164 | 2.2% | |
| 5 | 1082192 | 2.0% | |
| M | 1064170 | 2.0% | |
| + | 1026342 | 1.9% | |
| N | 992995 | 1.9% | |
| h | 981954 | 1.8% | |
| L | 952937 | 1.8% | |
| s | 909514 | 1.7% | |
| Other values (27) | 12148331 | 22.7% |
| Distinct count | 1520586 |
|---|---|
| Unique (%) | 91.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6944.327708413956 |
|---|---|
| Minimum | 0.0 |
| Maximum | 2139749.193825725 |
| Zeros | 141095 |
| Zeros (%) | 8.4% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 615.6311697 |
| median | 3182.976384 |
| Q3 | 8467.330739 |
| 95-th percentile | 23514.35447 |
| Maximum | 2139749.194 |
| Range | 2139749.194 |
| Interquartile range (IQR) | 7851.69957 |
Descriptive statistics
| Standard deviation | 18262.48613 |
|---|---|
| Coefficient of variation (CV) | 2.629842211 |
| Kurtosis | 1982.225868 |
| Mean | 6944.327708 |
| Median Absolute Deviation (MAD) | 3042.76966 |
| Skewness | 31.9875023 |
| Sum | 1.160393688e+10 |
| Variance | 333518399.7 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 141095 | 8.4% | |
| 0.0002 | 613 | < 0.1% | |
| 0.0001 | 416 | < 0.1% | |
| 0.0003 | 275 | < 0.1% | |
| 0.0004 | 272 | < 0.1% | |
| 0.0005 | 215 | < 0.1% | |
| 0.0006 | 195 | < 0.1% | |
| 0.0007 | 166 | < 0.1% | |
| 0.001 | 119 | < 0.1% | |
| 0.0011 | 113 | < 0.1% | |
| 0.0008 | 105 | < 0.1% | |
| 0.0014 | 99 | < 0.1% | |
| 0.0009 | 95 | < 0.1% | |
| 0.0015 | 89 | < 0.1% | |
| 0.0012 | 84 | < 0.1% | |
| 0.0006 | 74 | < 0.1% | |
| 0.0013 | 69 | < 0.1% | |
| 0.0018 | 60 | < 0.1% | |
| 0.0016 | 60 | < 0.1% | |
| 0.0012 | 60 | < 0.1% | |
| 0.002 | 59 | < 0.1% | |
| 0.0003 | 57 | < 0.1% | |
| 0.0021 | 56 | < 0.1% | |
| 0.0023 | 53 | < 0.1% | |
| 0.0026 | 48 | < 0.1% | |
| Other values (1520561) | 1526448 | 91.3% |
| Value | Count | Frequency (%) | |
| 0 | 141095 | 8.4% | |
| 4.067632434e-06 | 1 | < 0.1% | |
| 1.342487536e-05 | 1 | < 0.1% | |
| 1.516539208e-05 | 1 | < 0.1% | |
| 1.691178812e-05 | 1 | < 0.1% | |
| 1.809773927e-05 | 1 | < 0.1% | |
| 1.832454513e-05 | 1 | < 0.1% | |
| 2.103087305e-05 | 1 | < 0.1% | |
| 2.104947636e-05 | 1 | < 0.1% | |
| 2.263261949e-05 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2139749.194 | 1 | < 0.1% | |
| 2075870.408 | 1 | < 0.1% | |
| 2074207.292 | 1 | < 0.1% | |
| 2056893.437 | 1 | < 0.1% | |
| 1925177.265 | 1 | < 0.1% | |
| 1837929.954 | 1 | < 0.1% | |
| 1805984.798 | 1 | < 0.1% | |
| 1798152.12 | 1 | < 0.1% | |
| 1753827.461 | 1 | < 0.1% | |
| 1743709.275 | 1 | < 0.1% |
| Distinct count | 6299 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 274.3947707802836 |
|---|---|
| Minimum | 0.0 |
| Maximum | 28011.0 |
| Zeros | 109681 |
| Zeros (%) | 6.6% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 37 |
| median | 129 |
| Q3 | 330 |
| 95-th percentile | 990 |
| Maximum | 28011 |
| Range | 28011 |
| Interquartile range (IQR) | 293 |
Descriptive statistics
| Standard deviation | 481.0275064 |
|---|---|
| Coefficient of variation (CV) | 1.753049102 |
| Kurtosis | 163.2474291 |
| Mean | 274.3947708 |
| Median Absolute Deviation (MAD) | 111 |
| Skewness | 8.362652881 |
| Sum | 458512290 |
| Variance | 231387.4619 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 109681 | 6.6% | |
| 1 | 15671 | 0.9% | |
| 2 | 13311 | 0.8% | |
| 3 | 11786 | 0.7% | |
| 4 | 10789 | 0.6% | |
| 5 | 10340 | 0.6% | |
| 6 | 10009 | 0.6% | |
| 7 | 9657 | 0.6% | |
| 8 | 9146 | 0.5% | |
| 9 | 9144 | 0.5% | |
| 10 | 8788 | 0.5% | |
| 11 | 8656 | 0.5% | |
| 13 | 8567 | 0.5% | |
| 12 | 8496 | 0.5% | |
| 14 | 8292 | 0.5% | |
| 15 | 8168 | 0.5% | |
| 16 | 8166 | 0.5% | |
| 18 | 7842 | 0.5% | |
| 17 | 7696 | 0.5% | |
| 19 | 7686 | 0.5% | |
| 21 | 7657 | 0.5% | |
| 20 | 7588 | 0.5% | |
| 22 | 7492 | 0.4% | |
| 24 | 7428 | 0.4% | |
| 23 | 7352 | 0.4% | |
| Other values (6274) | 1341587 | 80.3% |
| Value | Count | Frequency (%) | |
| 0 | 109681 | 6.6% | |
| 1 | 15671 | 0.9% | |
| 2 | 13311 | 0.8% | |
| 3 | 11786 | 0.7% | |
| 4 | 10789 | 0.6% | |
| 5 | 10340 | 0.6% | |
| 6 | 10009 | 0.6% | |
| 7 | 9657 | 0.6% | |
| 8 | 9146 | 0.5% | |
| 9 | 9144 | 0.5% |
| Value | Count | Frequency (%) | |
| 28011 | 1 | < 0.1% | |
| 27858 | 1 | < 0.1% | |
| 23765 | 1 | < 0.1% | |
| 23400 | 1 | < 0.1% | |
| 22076 | 1 | < 0.1% | |
| 21776 | 1 | < 0.1% | |
| 20330 | 1 | < 0.1% | |
| 20180 | 1 | < 0.1% | |
| 19914 | 1 | < 0.1% | |
| 19630 | 1 | < 0.1% |
| Distinct count | 7971 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 250.01709580220168 |
|---|---|
| Minimum | 0.0 |
| Maximum | 536352.0 |
| Zeros | 59673 |
| Zeros (%) | 3.6% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 23 |
| median | 72 |
| Q3 | 214 |
| 95-th percentile | 957 |
| Maximum | 536352 |
| Range | 536352 |
| Interquartile range (IQR) | 191 |
Descriptive statistics
| Standard deviation | 1660.24333 |
|---|---|
| Coefficient of variation (CV) | 6.640519218 |
| Kurtosis | 23426.08747 |
| Mean | 250.0170958 |
| Median Absolute Deviation (MAD) | 61 |
| Skewness | 126.3236899 |
| Sum | 417777317 |
| Variance | 2756407.913 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 59673 | 3.6% | |
| 4 | 20474 | 1.2% | |
| 1 | 20261 | 1.2% | |
| 3 | 20235 | 1.2% | |
| 2 | 20225 | 1.2% | |
| 6 | 18518 | 1.1% | |
| 5 | 18202 | 1.1% | |
| 8 | 17694 | 1.1% | |
| 7 | 17153 | 1.0% | |
| 9 | 16955 | 1.0% | |
| 10 | 16699 | 1.0% | |
| 12 | 15918 | 1.0% | |
| 11 | 15732 | 0.9% | |
| 14 | 15113 | 0.9% | |
| 13 | 15070 | 0.9% | |
| 15 | 14517 | 0.9% | |
| 16 | 14297 | 0.9% | |
| 17 | 13926 | 0.8% | |
| 18 | 13898 | 0.8% | |
| 19 | 13345 | 0.8% | |
| 20 | 13240 | 0.8% | |
| 21 | 12959 | 0.8% | |
| 22 | 12942 | 0.8% | |
| 23 | 12289 | 0.7% | |
| 24 | 12242 | 0.7% | |
| Other values (7946) | 1229418 | 73.6% |
| Value | Count | Frequency (%) | |
| 0 | 59673 | 3.6% | |
| 1 | 20261 | 1.2% | |
| 2 | 20225 | 1.2% | |
| 3 | 20235 | 1.2% | |
| 4 | 20474 | 1.2% | |
| 5 | 18202 | 1.1% | |
| 6 | 18518 | 1.1% | |
| 7 | 17153 | 1.0% | |
| 8 | 17694 | 1.1% | |
| 9 | 16955 | 1.0% |
| Value | Count | Frequency (%) | |
| 536352 | 1 | < 0.1% | |
| 441888 | 1 | < 0.1% | |
| 342177 | 1 | < 0.1% | |
| 340597 | 1 | < 0.1% | |
| 337645 | 1 | < 0.1% | |
| 324460 | 1 | < 0.1% | |
| 315062 | 1 | < 0.1% | |
| 313807 | 1 | < 0.1% | |
| 302950 | 1 | < 0.1% | |
| 280995 | 1 | < 0.1% |
revenue
Real number (ℝ≥0)
| Distinct count | 8109 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1279.4329318394728 |
|---|---|
| Minimum | 5.4433 |
| Maximum | 16496.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 5.4433 |
|---|---|
| 5-th percentile | 499 |
| Q1 | 799 |
| median | 999 |
| Q3 | 1798 |
| 95-th percentile | 2499 |
| Maximum | 16496 |
| Range | 16490.5567 |
| Interquartile range (IQR) | 999 |
Descriptive statistics
| Standard deviation | 752.1332333 |
|---|---|
| Coefficient of variation (CV) | 0.5878645255 |
| Kurtosis | 9.589338837 |
| Mean | 1279.432932 |
| Median Absolute Deviation (MAD) | 400 |
| Skewness | 2.13414743 |
| Sum | 2137926032 |
| Variance | 565704.4007 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 999 | 284357 | 17.0% | |
| 599 | 228612 | 13.7% | |
| 799 | 207093 | 12.4% | |
| 1799 | 199957 | 12.0% | |
| 1499 | 192466 | 11.5% | |
| 1299 | 90862 | 5.4% | |
| 2499 | 57624 | 3.4% | |
| 1999 | 33433 | 2.0% | |
| 499 | 33007 | 2.0% | |
| 399 | 25942 | 1.6% | |
| 299 | 18970 | 1.1% | |
| 3799 | 15563 | 0.9% | |
| 2198 | 13473 | 0.8% | |
| 500 | 11922 | 0.7% | |
| 2898 | 7623 | 0.5% | |
| 1598 | 7491 | 0.4% | |
| 1678 | 7153 | 0.4% | |
| 1178 | 7092 | 0.4% | |
| 4198 | 7010 | 0.4% | |
| 332.75 | 6741 | 0.4% | |
| 1928 | 6684 | 0.4% | |
| 50 | 6392 | 0.4% | |
| 888 | 6107 | 0.4% | |
| 1898 | 5567 | 0.3% | |
| 1128 | 5043 | 0.3% | |
| Other values (8084) | 184811 | 11.1% |
| Value | Count | Frequency (%) | |
| 5.4433 | 1 | < 0.1% | |
| 5.4532 | 1 | < 0.1% | |
| 7.6781 | 1 | < 0.1% | |
| 8.6707 | 4 | < 0.1% | |
| 8.6757 | 1 | < 0.1% | |
| 8.6787 | 2 | < 0.1% | |
| 9.6416 | 2 | < 0.1% | |
| 11.9006 | 3 | < 0.1% | |
| 11.9007 | 3 | < 0.1% | |
| 11.9064 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 16496 | 1 | < 0.1% | |
| 15998.01 | 3 | < 0.1% | |
| 15998 | 2 | < 0.1% | |
| 13417.6719 | 1 | < 0.1% | |
| 12643.5745 | 2 | < 0.1% | |
| 12478.64 | 1 | < 0.1% | |
| 11293 | 1 | < 0.1% | |
| 10998 | 1 | < 0.1% | |
| 10837.3539 | 1 | < 0.1% | |
| 10720.5817 | 1 | < 0.1% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.7 MiB |
| Globe Postpaid |
|---|
| Value | Count | Frequency (%) | |
| Globe Postpaid | 1670995 | 100.0% |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 3341990 | 14.3% | |
| G | 1670995 | 7.1% | |
| l | 1670995 | 7.1% | |
| b | 1670995 | 7.1% | |
| e | 1670995 | 7.1% | |
| 1670995 | 7.1% | ||
| P | 1670995 | 7.1% | |
| s | 1670995 | 7.1% | |
| t | 1670995 | 7.1% | |
| p | 1670995 | 7.1% | |
| a | 1670995 | 7.1% | |
| i | 1670995 | 7.1% | |
| d | 1670995 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 18380945 | 78.6% | |
| Uppercase Letter | 3341990 | 14.3% | |
| Space Separator | 1670995 | 7.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| G | 1670995 | 50.0% | |
| P | 1670995 | 50.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 3341990 | 18.2% | |
| l | 1670995 | 9.1% | |
| b | 1670995 | 9.1% | |
| e | 1670995 | 9.1% | |
| s | 1670995 | 9.1% | |
| t | 1670995 | 9.1% | |
| p | 1670995 | 9.1% | |
| a | 1670995 | 9.1% | |
| i | 1670995 | 9.1% | |
| d | 1670995 | 9.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1670995 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 21722935 | 92.9% | |
| Common | 1670995 | 7.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 3341990 | 15.4% | |
| G | 1670995 | 7.7% | |
| l | 1670995 | 7.7% | |
| b | 1670995 | 7.7% | |
| e | 1670995 | 7.7% | |
| P | 1670995 | 7.7% | |
| s | 1670995 | 7.7% | |
| t | 1670995 | 7.7% | |
| p | 1670995 | 7.7% | |
| a | 1670995 | 7.7% | |
| i | 1670995 | 7.7% | |
| d | 1670995 | 7.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1670995 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 23393930 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 3341990 | 14.3% | |
| G | 1670995 | 7.1% | |
| l | 1670995 | 7.1% | |
| b | 1670995 | 7.1% | |
| e | 1670995 | 7.1% | |
| 1670995 | 7.1% | ||
| P | 1670995 | 7.1% | |
| s | 1670995 | 7.1% | |
| t | 1670995 | 7.1% | |
| p | 1670995 | 7.1% | |
| a | 1670995 | 7.1% | |
| i | 1670995 | 7.1% | |
| d | 1670995 | 7.1% |
| Distinct count | 149 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.7 MiB |
| nonOutlier_197 | 56126 |
|---|---|
| nonOutlier_255 | 38374 |
| nonOutlier_209 | 35507 |
| nonOutlier_297 | 30716 |
| nonOutlier_161 | 27623 |
| Other values (144) |
| Value | Count | Frequency (%) | |
| nonOutlier_197 | 56126 | 3.4% | |
| nonOutlier_255 | 38374 | 2.3% | |
| nonOutlier_209 | 35507 | 2.1% | |
| nonOutlier_297 | 30716 | 1.8% | |
| nonOutlier_161 | 27623 | 1.7% | |
| nonOutlier_85 | 25166 | 1.5% | |
| nonOutlier_20 | 20007 | 1.2% | |
| nonOutlier_53 | 19869 | 1.2% | |
| nonOutlier_185 | 18918 | 1.1% | |
| nonOutlier_108 | 18391 | 1.1% | |
| nonOutlier_205 | 18354 | 1.1% | |
| nonOutlier_62 | 17113 | 1.0% | |
| nonOutlier_158 | 16945 | 1.0% | |
| nonOutlier_191 | 16894 | 1.0% | |
| nonOutlier_30 | 16437 | 1.0% | |
| nonOutlier_199 | 16431 | 1.0% | |
| outlier_205.15 | 16193 | 1.0% | |
| nonOutlier_39 | 15896 | 1.0% | |
| nonOutlier_160 | 15706 | 0.9% | |
| nonOutlier_129 | 15629 | 0.9% | |
| nonOutlier_68 | 15421 | 0.9% | |
| nonOutlier_169 | 15269 | 0.9% | |
| nonOutlier_192 | 15177 | 0.9% | |
| nonOutlier_145 | 14692 | 0.9% | |
| nonOutlier_289 | 14669 | 0.9% | |
| Other values (124) | 1139472 | 68.2% |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 13.95216503 |
| Min length | 12 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 3023088 | 13.0% | |
| o | 1670995 | 7.2% | |
| u | 1670995 | 7.2% | |
| t | 1670995 | 7.2% | |
| l | 1670995 | 7.2% | |
| i | 1670995 | 7.2% | |
| e | 1670995 | 7.2% | |
| r | 1670995 | 7.2% | |
| _ | 1670995 | 7.2% | |
| O | 1511544 | 6.5% | |
| 1 | 1157862 | 5.0% | |
| 2 | 939905 | 4.0% | |
| 5 | 592796 | 2.5% | |
| 0 | 512356 | 2.2% | |
| . | 372341 | 1.6% | |
| 8 | 370198 | 1.6% | |
| 9 | 351140 | 1.5% | |
| 7 | 336100 | 1.4% | |
| 3 | 297333 | 1.3% | |
| 6 | 271072 | 1.2% | |
| 4 | 210303 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 14720053 | 63.1% | |
| Decimal Number | 5039065 | 21.6% | |
| Connector Punctuation | 1670995 | 7.2% | |
| Uppercase Letter | 1511544 | 6.5% | |
| Other Punctuation | 372341 | 1.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 3023088 | 20.5% | |
| o | 1670995 | 11.4% | |
| u | 1670995 | 11.4% | |
| t | 1670995 | 11.4% | |
| l | 1670995 | 11.4% | |
| i | 1670995 | 11.4% | |
| e | 1670995 | 11.4% | |
| r | 1670995 | 11.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| O | 1511544 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 1670995 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1157862 | 23.0% | |
| 2 | 939905 | 18.7% | |
| 5 | 592796 | 11.8% | |
| 0 | 512356 | 10.2% | |
| 8 | 370198 | 7.3% | |
| 9 | 351140 | 7.0% | |
| 7 | 336100 | 6.7% | |
| 3 | 297333 | 5.9% | |
| 6 | 271072 | 5.4% | |
| 4 | 210303 | 4.2% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 372341 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 16231597 | 69.6% | |
| Common | 7082401 | 30.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 3023088 | 18.6% | |
| o | 1670995 | 10.3% | |
| u | 1670995 | 10.3% | |
| t | 1670995 | 10.3% | |
| l | 1670995 | 10.3% | |
| i | 1670995 | 10.3% | |
| e | 1670995 | 10.3% | |
| r | 1670995 | 10.3% | |
| O | 1511544 | 9.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| _ | 1670995 | 23.6% | |
| 1 | 1157862 | 16.3% | |
| 2 | 939905 | 13.3% | |
| 5 | 592796 | 8.4% | |
| 0 | 512356 | 7.2% | |
| . | 372341 | 5.3% | |
| 8 | 370198 | 5.2% | |
| 9 | 351140 | 5.0% | |
| 7 | 336100 | 4.7% | |
| 3 | 297333 | 4.2% | |
| 6 | 271072 | 3.8% | |
| 4 | 210303 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 23313998 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 3023088 | 13.0% | |
| o | 1670995 | 7.2% | |
| u | 1670995 | 7.2% | |
| t | 1670995 | 7.2% | |
| l | 1670995 | 7.2% | |
| i | 1670995 | 7.2% | |
| e | 1670995 | 7.2% | |
| r | 1670995 | 7.2% | |
| _ | 1670995 | 7.2% | |
| O | 1511544 | 6.5% | |
| 1 | 1157862 | 5.0% | |
| 2 | 939905 | 4.0% | |
| 5 | 592796 | 2.5% | |
| 0 | 512356 | 2.2% | |
| . | 372341 | 1.6% | |
| 8 | 370198 | 1.6% | |
| 9 | 351140 | 1.5% | |
| 7 | 336100 | 1.4% | |
| 3 | 297333 | 1.3% | |
| 6 | 271072 | 1.2% | |
| 4 | 210303 | 0.9% |
| Distinct count | 1448178 |
|---|---|
| Unique (%) | 86.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 550.3867643952545 |
|---|---|
| Minimum | 0.0 |
| Maximum | 23707.17016910278 |
| Zeros | 219596 |
| Zeros (%) | 13.1% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 11.9288669 |
| median | 225.2723017 |
| Q3 | 969.2987249 |
| 95-th percentile | 1827.736836 |
| Maximum | 23707.17017 |
| Range | 23707.17017 |
| Interquartile range (IQR) | 957.369858 |
Descriptive statistics
| Standard deviation | 666.0653108 |
|---|---|
| Coefficient of variation (CV) | 1.210176832 |
| Kurtosis | 7.784348364 |
| Mean | 550.3867644 |
| Median Absolute Deviation (MAD) | 225.2723017 |
| Skewness | 1.610628698 |
| Sum | 919693531.4 |
| Variance | 443642.9983 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 219596 | 13.1% | |
| 5.101499129e-06 | 55 | < 0.1% | |
| 5.793866844e-06 | 47 | < 0.1% | |
| 1.65990438e-06 | 34 | < 0.1% | |
| 0.0002877980035 | 31 | < 0.1% | |
| 1.079804646e-05 | 30 | < 0.1% | |
| 8.579106278e-06 | 28 | < 0.1% | |
| 9.198047422e-06 | 24 | < 0.1% | |
| 2.896933422e-06 | 24 | < 0.1% | |
| 3.875707352e-06 | 24 | < 0.1% | |
| 8.299521901e-06 | 23 | < 0.1% | |
| 7.751414703e-06 | 23 | < 0.1% | |
| 2.550749564e-06 | 21 | < 0.1% | |
| 0.0004316970053 | 21 | < 0.1% | |
| 3.319808761e-06 | 21 | < 0.1% | |
| 0.0005755960071 | 21 | < 0.1% | |
| 0.0001438990018 | 20 | < 0.1% | |
| 1.682338549e-05 | 19 | < 0.1% | |
| 2.255225223e-05 | 17 | < 0.1% | |
| 1.161933066e-05 | 16 | < 0.1% | |
| 0.0007194950089 | 16 | < 0.1% | |
| 9.959426282e-06 | 15 | < 0.1% | |
| 0.0008633940106 | 15 | < 0.1% | |
| 4.979713141e-06 | 15 | < 0.1% | |
| 1.127612611e-05 | 15 | < 0.1% | |
| Other values (1448153) | 1450824 | 86.8% |
| Value | Count | Frequency (%) | |
| 0 | 219596 | 13.1% | |
| 5.678980783e-08 | 1 | < 0.1% | |
| 1.03237192e-07 | 1 | < 0.1% | |
| 1.054433649e-07 | 1 | < 0.1% | |
| 1.069784889e-07 | 1 | < 0.1% | |
| 1.079303859e-07 | 1 | < 0.1% | |
| 1.188604592e-07 | 1 | < 0.1% | |
| 1.19553824e-07 | 1 | < 0.1% | |
| 1.360401498e-07 | 1 | < 0.1% | |
| 1.414507668e-07 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 23707.17017 | 1 | < 0.1% | |
| 20535.52964 | 1 | < 0.1% | |
| 17426.40902 | 1 | < 0.1% | |
| 15586.53647 | 1 | < 0.1% | |
| 15560.79331 | 1 | < 0.1% | |
| 14677.39432 | 1 | < 0.1% | |
| 14606.3031 | 1 | < 0.1% | |
| 14418.29102 | 1 | < 0.1% | |
| 14289.83402 | 1 | < 0.1% | |
| 14120.58677 | 1 | < 0.1% |
| Distinct count | 52394 |
|---|---|
| Unique (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 418.36403773044697 |
|---|---|
| Minimum | 0.0 |
| Maximum | 14947.934872527232 |
| Zeros | 222363 |
| Zeros (%) | 13.3% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 23.09784958 |
| median | 211.7424847 |
| Q3 | 649.944579 |
| 95-th percentile | 1412.76486 |
| Maximum | 14947.93487 |
| Range | 14947.93487 |
| Interquartile range (IQR) | 626.8467295 |
Descriptive statistics
| Standard deviation | 557.6118981 |
|---|---|
| Coefficient of variation (CV) | 1.332838982 |
| Kurtosis | 25.09425328 |
| Mean | 418.3640377 |
| Median Absolute Deviation (MAD) | 211.7424847 |
| Skewness | 3.296263002 |
| Sum | 699084215.2 |
| Variance | 310931.0289 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 222363 | 13.3% | |
| 44.35234549 | 811 | < 0.1% | |
| 88.70469098 | 598 | < 0.1% | |
| 4.802503189 | 500 | < 0.1% | |
| 133.0570365 | 478 | < 0.1% | |
| 1.93199149 | 468 | < 0.1% | |
| 15.73754913 | 453 | < 0.1% | |
| 9.605006377 | 430 | < 0.1% | |
| 39.53575462 | 430 | < 0.1% | |
| 3.86398298 | 425 | < 0.1% | |
| 12.54556965 | 414 | < 0.1% | |
| 236.063237 | 412 | < 0.1% | |
| 0.9478114819 | 408 | < 0.1% | |
| 221.7617275 | 383 | < 0.1% | |
| 31.47509826 | 380 | < 0.1% | |
| 177.409382 | 379 | < 0.1% | |
| 283.2758844 | 376 | < 0.1% | |
| 14.40750957 | 374 | < 0.1% | |
| 13.63694921 | 373 | < 0.1% | |
| 8.917066885 | 366 | < 0.1% | |
| 5.795974471 | 366 | < 0.1% | |
| 299.0134335 | 365 | < 0.1% | |
| 267.5383352 | 358 | < 0.1% | |
| 251.8007861 | 356 | < 0.1% | |
| 7.727965961 | 351 | < 0.1% | |
| Other values (52369) | 1438378 | 86.1% |
| Value | Count | Frequency (%) | |
| 0 | 222363 | 13.3% | |
| 0.06859984581 | 141 | < 0.1% | |
| 0.1278676012 | 163 | < 0.1% | |
| 0.1371996916 | 147 | < 0.1% | |
| 0.1712812279 | 76 | < 0.1% | |
| 0.1735838188 | 154 | < 0.1% | |
| 0.1929535719 | 159 | < 0.1% | |
| 0.1951436176 | 5 | < 0.1% | |
| 0.2057995374 | 114 | < 0.1% | |
| 0.2557352024 | 163 | < 0.1% |
| Value | Count | Frequency (%) | |
| 14947.93487 | 1 | < 0.1% | |
| 14816.04133 | 1 | < 0.1% | |
| 13020.82366 | 1 | < 0.1% | |
| 12954.87689 | 1 | < 0.1% | |
| 12947.54947 | 1 | < 0.1% | |
| 12885.26641 | 1 | < 0.1% | |
| 12808.32851 | 1 | < 0.1% | |
| 12793.67367 | 1 | < 0.1% | |
| 12782.68254 | 1 | < 0.1% | |
| 12746.04545 | 1 | < 0.1% |
| Distinct count | 61312 |
|---|---|
| Unique (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 228.1703653534256 |
|---|---|
| Minimum | 0.0 |
| Maximum | 22928.001073226926 |
| Zeros | 296058 |
| Zeros (%) | 17.7% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4.945962975 |
| median | 46.27551601 |
| Q3 | 296.8329611 |
| 95-th percentile | 1019.085424 |
| Maximum | 22928.00107 |
| Range | 22928.00107 |
| Interquartile range (IQR) | 291.8869982 |
Descriptive statistics
| Standard deviation | 374.9456973 |
|---|---|
| Coefficient of variation (CV) | 1.643270793 |
| Kurtosis | 26.76355773 |
| Mean | 228.1703654 |
| Median Absolute Deviation (MAD) | 46.27551601 |
| Skewness | 2.774820483 |
| Sum | 381271539.7 |
| Variance | 140584.2759 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 296058 | 17.7% | |
| 11.39302417 | 797 | < 0.1% | |
| 36.87243716 | 751 | < 0.1% | |
| 22.78604833 | 675 | < 0.1% | |
| 3.485800936 | 632 | < 0.1% | |
| 45.57209666 | 631 | < 0.1% | |
| 73.74487431 | 629 | < 0.1% | |
| 110.6173115 | 616 | < 0.1% | |
| 34.1790725 | 614 | < 0.1% | |
| 10.45740281 | 600 | < 0.1% | |
| 147.4897486 | 599 | < 0.1% | |
| 6.971601872 | 582 | < 0.1% | |
| 4.146788403 | 558 | < 0.1% | |
| 24.66709525 | 545 | < 0.1% | |
| 16.58715361 | 534 | < 0.1% | |
| 13.94320374 | 533 | < 0.1% | |
| 12.44036521 | 532 | < 0.1% | |
| 8.293576807 | 523 | < 0.1% | |
| 3.33051803 | 493 | < 0.1% | |
| 49.68866596 | 491 | < 0.1% | |
| 56.96512083 | 481 | < 0.1% | |
| 15.27311281 | 470 | < 0.1% | |
| 17.42900468 | 469 | < 0.1% | |
| 68.358145 | 465 | < 0.1% | |
| 20.73394202 | 456 | < 0.1% | |
| Other values (61287) | 1361261 | 81.5% |
| Value | Count | Frequency (%) | |
| 0 | 296058 | 17.7% | |
| 0.0003899035586 | 25 | < 0.1% | |
| 0.0007798071173 | 38 | < 0.1% | |
| 0.001169710676 | 36 | < 0.1% | |
| 0.001559614235 | 48 | < 0.1% | |
| 0.001949517793 | 55 | < 0.1% | |
| 0.002339421352 | 47 | < 0.1% | |
| 0.002729324911 | 57 | < 0.1% | |
| 0.003119228469 | 55 | < 0.1% | |
| 0.003509132028 | 46 | < 0.1% |
| Value | Count | Frequency (%) | |
| 22928.00107 | 1 | < 0.1% | |
| 21772.07376 | 1 | < 0.1% | |
| 16549.91674 | 1 | < 0.1% | |
| 14500.75102 | 1 | < 0.1% | |
| 14387.06782 | 1 | < 0.1% | |
| 10486.57191 | 1 | < 0.1% | |
| 7911.676202 | 1 | < 0.1% | |
| 7495.210396 | 1 | < 0.1% | |
| 6715.246259 | 1 | < 0.1% | |
| 6676.018071 | 1 | < 0.1% |
| Distinct count | 1371360 |
|---|---|
| Unique (%) | 82.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 583.9378945965124 |
|---|---|
| Minimum | 0.0 |
| Maximum | 12643.5745 |
| Zeros | 219596 |
| Zeros (%) | 13.1% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 14.11029907 |
| median | 262.4046133 |
| Q3 | 999 |
| 95-th percentile | 1816.452797 |
| Maximum | 12643.5745 |
| Range | 12643.5745 |
| Interquartile range (IQR) | 984.8897009 |
Descriptive statistics
| Standard deviation | 689.0691117 |
|---|---|
| Coefficient of variation (CV) | 1.180038353 |
| Kurtosis | 2.394431278 |
| Mean | 583.9378946 |
| Median Absolute Deviation (MAD) | 262.4046133 |
| Skewness | 1.338096967 |
| Sum | 975757302.2 |
| Variance | 474816.2407 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 219596 | 13.1% | |
| 999 | 27027 | 1.6% | |
| 599 | 13928 | 0.8% | |
| 799 | 10497 | 0.6% | |
| 1499 | 6389 | 0.4% | |
| 1299 | 4305 | 0.3% | |
| 499 | 2092 | 0.1% | |
| 1799 | 1817 | 0.1% | |
| 299 | 1714 | 0.1% | |
| 50 | 1350 | 0.1% | |
| 2499 | 691 | < 0.1% | |
| 1128 | 584 | < 0.1% | |
| 399 | 500 | < 0.1% | |
| 3799 | 469 | < 0.1% | |
| 1999 | 463 | < 0.1% | |
| 598 | 462 | < 0.1% | |
| 1098 | 371 | < 0.1% | |
| 300 | 347 | < 0.1% | |
| 1178 | 319 | < 0.1% | |
| 500 | 277 | < 0.1% | |
| 898 | 240 | < 0.1% | |
| 928 | 191 | < 0.1% | |
| 1598 | 178 | < 0.1% | |
| 888 | 167 | < 0.1% | |
| 1298 | 164 | < 0.1% | |
| Other values (1371335) | 1376857 | 82.4% |
| Value | Count | Frequency (%) | |
| 0 | 219596 | 13.1% | |
| 4.033344825e-08 | 1 | < 0.1% | |
| 8.365554269e-08 | 1 | < 0.1% | |
| 8.891543244e-08 | 1 | < 0.1% | |
| 9.538970373e-08 | 1 | < 0.1% | |
| 1.008753072e-07 | 1 | < 0.1% | |
| 1.050391029e-07 | 1 | < 0.1% | |
| 1.159086231e-07 | 1 | < 0.1% | |
| 1.194923765e-07 | 1 | < 0.1% | |
| 1.242036327e-07 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 12643.5745 | 2 | < 0.1% | |
| 10204.84618 | 1 | < 0.1% | |
| 9352.9687 | 1 | < 0.1% | |
| 8533.623012 | 1 | < 0.1% | |
| 8460.598416 | 1 | < 0.1% | |
| 8398 | 8 | < 0.1% | |
| 8239.745905 | 1 | < 0.1% | |
| 8128 | 1 | < 0.1% | |
| 8079.917043 | 1 | < 0.1% | |
| 8062.9061 | 1 | < 0.1% |
| Distinct count | 1416187 |
|---|---|
| Unique (%) | 84.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 447.4594534052984 |
|---|---|
| Minimum | 0.0 |
| Maximum | 15574.880911502554 |
| Zeros | 222363 |
| Zeros (%) | 13.3% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 24.43840241 |
| median | 235.317158 |
| Q3 | 671.9352854 |
| 95-th percentile | 1485.452624 |
| Maximum | 15574.88091 |
| Range | 15574.88091 |
| Interquartile range (IQR) | 647.496883 |
Descriptive statistics
| Standard deviation | 597.935246 |
|---|---|
| Coefficient of variation (CV) | 1.336289225 |
| Kurtosis | 16.34980047 |
| Mean | 447.4594534 |
| Median Absolute Deviation (MAD) | 234.6116059 |
| Skewness | 2.987280169 |
| Sum | 747702509.3 |
| Variance | 357526.5584 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 222363 | 13.3% | |
| 999 | 8244 | 0.5% | |
| 599 | 1973 | 0.1% | |
| 799 | 1370 | 0.1% | |
| 399 | 601 | < 0.1% | |
| 499 | 461 | < 0.1% | |
| 299 | 307 | < 0.1% | |
| 332.75 | 239 | < 0.1% | |
| 1178 | 222 | < 0.1% | |
| 500 | 133 | < 0.1% | |
| 1098 | 127 | < 0.1% | |
| 1799 | 122 | < 0.1% | |
| 300 | 92 | < 0.1% | |
| 1128 | 79 | < 0.1% | |
| 50 | 74 | < 0.1% | |
| 1499 | 74 | < 0.1% | |
| 1198 | 73 | < 0.1% | |
| 1088 | 64 | < 0.1% | |
| 898 | 61 | < 0.1% | |
| 133.1 | 60 | < 0.1% | |
| 1299 | 53 | < 0.1% | |
| 888 | 47 | < 0.1% | |
| 347.7537836 | 45 | < 0.1% | |
| 688 | 40 | < 0.1% | |
| 384.9213303 | 38 | < 0.1% | |
| Other values (1416162) | 1434033 | 85.8% |
| Value | Count | Frequency (%) | |
| 0 | 222363 | 13.3% | |
| 0.01261880484 | 1 | < 0.1% | |
| 0.02131756006 | 1 | < 0.1% | |
| 0.03791272075 | 1 | < 0.1% | |
| 0.05452517886 | 1 | < 0.1% | |
| 0.05481075236 | 1 | < 0.1% | |
| 0.05507232511 | 1 | < 0.1% | |
| 0.05574109386 | 1 | < 0.1% | |
| 0.05594937173 | 1 | < 0.1% | |
| 0.05599623645 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 15574.88091 | 1 | < 0.1% | |
| 15207.38876 | 1 | < 0.1% | |
| 14602.41745 | 1 | < 0.1% | |
| 13268.42166 | 1 | < 0.1% | |
| 12279.3486 | 1 | < 0.1% | |
| 10181.25513 | 1 | < 0.1% | |
| 10002.699 | 1 | < 0.1% | |
| 9743.077349 | 1 | < 0.1% | |
| 9738.451532 | 1 | < 0.1% | |
| 9498.4933 | 1 | < 0.1% |
| Distinct count | 1316006 |
|---|---|
| Unique (%) | 78.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 248.0355838376625 |
|---|---|
| Minimum | 0.0 |
| Maximum | 9566.834161612367 |
| Zeros | 296058 |
| Zeros (%) | 17.7% |
| Memory size | 12.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5.213030217 |
| median | 49.63484142 |
| Q3 | 332.75 |
| 95-th percentile | 1114.186221 |
| Maximum | 9566.834162 |
| Range | 9566.834162 |
| Interquartile range (IQR) | 327.5369698 |
Descriptive statistics
| Standard deviation | 395.1969905 |
|---|---|
| Coefficient of variation (CV) | 1.593307639 |
| Kurtosis | 6.877999462 |
| Mean | 248.0355838 |
| Median Absolute Deviation (MAD) | 49.63484142 |
| Skewness | 2.272678728 |
| Sum | 414466220.4 |
| Variance | 156180.6613 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 296058 | 17.7% | |
| 599 | 8731 | 0.5% | |
| 799 | 5863 | 0.4% | |
| 999 | 5431 | 0.3% | |
| 1499 | 4339 | 0.3% | |
| 1799 | 2592 | 0.2% | |
| 299 | 1890 | 0.1% | |
| 1299 | 1829 | 0.1% | |
| 499 | 1642 | 0.1% | |
| 399 | 1280 | 0.1% | |
| 50 | 941 | 0.1% | |
| 332.75 | 916 | 0.1% | |
| 500 | 592 | < 0.1% | |
| 2499 | 588 | < 0.1% | |
| 133.1 | 376 | < 0.1% | |
| 888 | 338 | < 0.1% | |
| 1999 | 334 | < 0.1% | |
| 1178 | 272 | < 0.1% | |
| 1678 | 247 | < 0.1% | |
| 688 | 219 | < 0.1% | |
| 698 | 205 | < 0.1% | |
| 898 | 172 | < 0.1% | |
| 349 | 137 | < 0.1% | |
| 1098 | 132 | < 0.1% | |
| 2198 | 125 | < 0.1% | |
| Other values (1315981) | 1335746 | 79.9% |
| Value | Count | Frequency (%) | |
| 0 | 296058 | 17.7% | |
| 0.0002530191929 | 1 | < 0.1% | |
| 0.0003030788662 | 1 | < 0.1% | |
| 0.000356185438 | 1 | < 0.1% | |
| 0.0003610480349 | 1 | < 0.1% | |
| 0.0003629059139 | 1 | < 0.1% | |
| 0.0003757977677 | 1 | < 0.1% | |
| 0.0003819027817 | 2 | < 0.1% | |
| 0.0003896392971 | 1 | < 0.1% | |
| 0.0003903581883 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9566.834162 | 1 | < 0.1% | |
| 8398 | 1 | < 0.1% | |
| 8195.285992 | 1 | < 0.1% | |
| 7981.05418 | 1 | < 0.1% | |
| 7863.130512 | 1 | < 0.1% | |
| 7224.819179 | 1 | < 0.1% | |
| 7016.064125 | 1 | < 0.1% | |
| 6976.6726 | 1 | < 0.1% | |
| 6247.599234 | 1 | < 0.1% | |
| 5950.943302 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| subsId | data | voice | sms | revenue | brand | clusterId | dataRevenuePredicted | voiceRevenuePredicted | smsRevenuePredicted | dataRevenue | voiceRevenue | smsRevenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | J41EATby/kOcD+hsMASxhp2d+JLRj2Rl | 6317.891335 | 27.0 | 7.0 | 1499.0 | Globe Postpaid | nonOutlier_127 | 1286.391797 | 4.686763 | 1.683700 | 1491.613240 | 5.434455 | 1.952305 |
| 1 | J41EATby/kOcD5NtMAT8nZqy3rzRj2Rl | 1947.336953 | 23.0 | 70.0 | 1078.0 | Globe Postpaid | nonOutlier_192 | 222.616901 | 41.587835 | 289.731403 | 433.228674 | 80.932950 | 563.838375 |
| 2 | J41EATby/kOcD+hsPk6LppqI3pqauWRl | 5221.555423 | 549.0 | 88.0 | 599.0 | Globe Postpaid | nonOutlier_205 | 146.483612 | 576.906560 | 5.343544 | 120.405687 | 474.202060 | 4.392253 |
| 3 | J41EATby/kOcD5NsNASLnZq3/+7Qn2Rl | 7572.696437 | 228.0 | 33.0 | 1799.0 | Globe Postpaid | nonOutlier_12 | 1417.089260 | 821.881365 | 9.620958 | 1133.751277 | 657.551414 | 7.697309 |
| 4 | J41EATby/kOcAM5qMAKTkZqz7JLRn2Rl | 1453.581084 | 183.0 | 33.0 | 1977.0 | Globe Postpaid | nonOutlier_11 | 147.551389 | 1133.509674 | 99.977322 | 211.224467 | 1622.654844 | 143.120689 |
| 5 | J41EATby/kOcD5NsNASLo52d+JbQuWRl | 23480.104545 | 163.0 | 77.0 | 1799.0 | Globe Postpaid | nonOutlier_269 | 1558.531222 | 59.903914 | 18.373927 | 1712.965630 | 65.839776 | 20.194594 |
| 6 | J41EATby/kOcD+hqNDuPnZqJ7JrQuWRl | 0.000000 | 0.0 | 74.0 | 999.0 | Globe Postpaid | nonOutlier_143 | 0.000000 | 0.000000 | 750.041299 | 0.000000 | 0.000000 | 999.000000 |
| 7 | J41EATby/kOcAM5sMkqLlJ2d+JrRj2Rl | 1945.774417 | 432.0 | 470.0 | 999.0 | Globe Postpaid | nonOutlier_209 | 56.367789 | 288.085401 | 436.275380 | 72.126759 | 368.626596 | 558.246644 |
| 8 | J41EATby/kOcD+BsNASxhp2Z7LLQj2Rl | 12504.552249 | 605.0 | 216.0 | 799.0 | Globe Postpaid | nonOutlier_161 | 731.073833 | 357.694142 | 142.436250 | 474.436312 | 232.128524 | 92.435163 |
| 9 | J41EATby/kOcD5NqMAKDkZqL3rzRj2Rl | 0.000000 | 155.0 | 45.0 | 799.0 | Globe Postpaid | nonOutlier_208 | 0.000000 | 701.279785 | 44.726868 | 0.000000 | 751.095913 | 47.904087 |
Last rows
| subsId | data | voice | sms | revenue | brand | clusterId | dataRevenuePredicted | voiceRevenuePredicted | smsRevenuePredicted | dataRevenue | voiceRevenue | smsRevenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1670985 | J41EATby/kOcD+hsMAehhprA+JLAuWRl | 4386.555650 | 1658.0 | 8807.0 | 1799.0000 | Globe Postpaid | outlier_205.8 | 58.628366 | 1217.277307 | 479.250508 | 60.092903 | 1247.684907 | 491.222190 |
| 1670986 | J41EATby/kOcRuBqNASLg5rA+I6auWRl | 0.032700 | 8.0 | 1282.0 | 599.0000 | Globe Postpaid | outlier_205.12.2 | 0.000579 | 14.135512 | 106.050813 | 0.002885 | 70.450035 | 528.547080 |
| 1670987 | J41EATby/kOcD+htMAT8gJ2Z7JLQj2Rl | 56957.557020 | 31.0 | 31.0 | 1499.0000 | Globe Postpaid | outlier_205.7.0 | 1243.856496 | 23.431362 | 7.091680 | 1463.097007 | 27.561344 | 8.341650 |
| 1670988 | J41EATby/kOcRuBtMkr8ppqI3pbRj2Rl | 43716.364295 | 892.0 | 143.0 | 3799.0000 | Globe Postpaid | outlier_205.18.1 | 1623.541121 | 3605.512498 | 1.009232 | 1179.303747 | 2618.963169 | 0.733084 |
| 1670989 | J41EATby/kOcRuBqMAKTppq13rzRj2Rl | 1957.039154 | 779.0 | 1777.0 | 799.0000 | Globe Postpaid | outlier_205.17 | 11.530992 | 638.952231 | 6.467238 | 14.024288 | 777.110091 | 7.865621 |
| 1670990 | J41EATby/kOcAM5sNASLhpq3+I7QqWRl | 3549.967787 | 563.0 | 152.0 | 5398.0000 | Globe Postpaid | outlier_205.1 | 268.270520 | 2062.668464 | 9.937130 | 618.624906 | 4756.460330 | 22.914765 |
| 1670991 | J41EATby/kOcD+htMAehhJ2Z7LLQj2Rl | 154740.937429 | 0.0 | 0.0 | 999.0000 | Globe Postpaid | outlier_205.5.0 | 270.401325 | 0.000000 | 0.000000 | 999.000000 | 0.000000 | 0.000000 |
| 1670992 | J41EATby/kOcAN5sPkr8gJqJ7LzAuWRl | 3423.890440 | 1650.0 | 1020.0 | 1499.0000 | Globe Postpaid | outlier_205.8 | 45.761895 | 1211.403834 | 55.505339 | 52.257632 | 1383.358247 | 63.384122 |
| 1670993 | J41EATby/kOcRuBqMASLhpqI3pqauWRl | 20790.104865 | 6.0 | 1.0 | 4917.3842 | Globe Postpaid | outlier_205.1 | 1571.105026 | 21.982257 | 0.065376 | 4849.332541 | 67.849872 | 0.201787 |
| 1670994 | J41EATby/kOcD5NqMDuPppqL3rLRj2Rl | 1872.329191 | 1441.0 | 108.0 | 599.0000 | Globe Postpaid | outlier_205.15 | 23.897468 | 519.817750 | 0.837042 | 26.286886 | 571.792379 | 0.920735 |